Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 677 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 58.2 KiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 2 |
Age is highly overall correlated with AgeCategory and 1 other fields | High correlation |
AgeCategory is highly overall correlated with Age and 1 other fields | High correlation |
Insulin is highly overall correlated with SkinThickness | High correlation |
Pregnancies is highly overall correlated with Age and 1 other fields | High correlation |
SkinThickness is highly overall correlated with Insulin | High correlation |
Pregnancies has 92 (13.6%) zeros | Zeros |
SkinThickness has 183 (27.0%) zeros | Zeros |
Insulin has 323 (47.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-09 14:33:24.132211 |
|---|---|
| Analysis finished | 2024-11-09 14:33:45.469250 |
| Duration | 21.34 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
Pregnancies
Real number (ℝ)
High correlation  Zeros 
| Distinct | 17 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8714919 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 92 |
| Zeros (%) | 13.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.3736316 |
|---|---|
| Coefficient of variation (CV) | 0.87140351 |
| Kurtosis | 0.20127263 |
| Mean | 3.8714919 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.91557613 |
| Sum | 2621 |
| Variance | 11.38139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 123 | |
| 0 | 92 | |
| 2 | 90 | |
| 3 | 67 | |
| 4 | 61 | |
| 5 | 51 | |
| 6 | 45 | 6.6% |
| 7 | 39 | 5.8% |
| 8 | 31 | 4.6% |
| 9 | 25 | 3.7% |
| Other values (7) | 53 |
| Value | Count | Frequency (%) |
| 0 | 92 | |
| 1 | 123 | |
| 2 | 90 | |
| 3 | 67 | |
| 4 | 61 | |
| 5 | 51 | |
| 6 | 45 | 6.6% |
| 7 | 39 | 5.8% |
| 8 | 31 | 4.6% |
| 9 | 25 | 3.7% |
| Value | Count | Frequency (%) |
| 17 | 1 | 0.1% |
| 15 | 1 | 0.1% |
| 14 | 2 | 0.3% |
| 13 | 8 | 1.2% |
| 12 | 9 | 1.3% |
| 11 | 10 | 1.5% |
| 10 | 22 | |
| 9 | 25 | |
| 8 | 31 | |
| 7 | 39 |
Glucose
Real number (ℝ)
| Distinct | 134 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 118.96455 |
| Minimum | 0 |
|---|---|
| Maximum | 199 |
| Zeros | 5 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 78 |
| Q1 | 99 |
| median | 114 |
| Q3 | 137 |
| 95-th percentile | 179 |
| Maximum | 199 |
| Range | 199 |
| Interquartile range (IQR) | 38 |
Descriptive statistics
| Standard deviation | 31.293352 |
|---|---|
| Coefficient of variation (CV) | 0.26304771 |
| Kurtosis | 0.92098212 |
| Mean | 118.96455 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | 0.16841553 |
| Sum | 80539 |
| Variance | 979.27389 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 16 | 2.4% |
| 99 | 15 | 2.2% |
| 106 | 14 | 2.1% |
| 111 | 14 | 2.1% |
| 108 | 13 | 1.9% |
| 112 | 13 | 1.9% |
| 95 | 13 | 1.9% |
| 125 | 13 | 1.9% |
| 122 | 12 | 1.8% |
| 109 | 12 | 1.8% |
| Other values (124) | 542 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 44 | 1 | 0.1% |
| 56 | 1 | 0.1% |
| 57 | 1 | 0.1% |
| 61 | 1 | 0.1% |
| 62 | 1 | 0.1% |
| 65 | 1 | 0.1% |
| 67 | 1 | 0.1% |
| 68 | 3 | |
| 71 | 4 |
| Value | Count | Frequency (%) |
| 199 | 1 | 0.1% |
| 198 | 1 | 0.1% |
| 197 | 1 | 0.1% |
| 196 | 3 | |
| 195 | 2 | |
| 194 | 2 | |
| 193 | 1 | 0.1% |
| 191 | 1 | 0.1% |
| 190 | 1 | 0.1% |
| 189 | 2 |
BloodPressure
Real number (ℝ)
| Distinct | 40 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 72.088626 |
| Minimum | 38 |
|---|---|
| Maximum | 106 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 38 |
|---|---|
| 5-th percentile | 54 |
| Q1 | 64 |
| median | 72 |
| Q3 | 80 |
| 95-th percentile | 90 |
| Maximum | 106 |
| Range | 68 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.396737 |
|---|---|
| Coefficient of variation (CV) | 0.15809342 |
| Kurtosis | -0.029933566 |
| Mean | 72.088626 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.068647995 |
| Sum | 48804 |
| Variance | 129.88562 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 70 | 51 | 7.5% |
| 74 | 50 | 7.4% |
| 72 | 44 | 6.5% |
| 78 | 43 | 6.4% |
| 68 | 43 | 6.4% |
| 64 | 40 | 5.9% |
| 76 | 37 | 5.5% |
| 80 | 37 | 5.5% |
| 60 | 34 | 5.0% |
| 62 | 32 | 4.7% |
| Other values (30) | 266 |
| Value | Count | Frequency (%) |
| 38 | 1 | 0.1% |
| 40 | 1 | 0.1% |
| 44 | 4 | 0.6% |
| 46 | 1 | 0.1% |
| 48 | 5 | |
| 50 | 11 | |
| 52 | 10 | |
| 54 | 11 | |
| 55 | 2 | 0.3% |
| 56 | 12 |
| Value | Count | Frequency (%) |
| 106 | 3 | 0.4% |
| 104 | 2 | 0.3% |
| 102 | 1 | 0.1% |
| 100 | 2 | 0.3% |
| 98 | 2 | 0.3% |
| 96 | 3 | 0.4% |
| 95 | 1 | 0.1% |
| 94 | 6 | 0.9% |
| 92 | 8 | |
| 90 | 19 |
SkinThickness
Real number (ℝ)
High correlation  Zeros 
| Distinct | 48 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.939439 |
| Minimum | 0 |
|---|---|
| Maximum | 60 |
| Zeros | 183 |
| Zeros (%) | 27.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 23 |
| Q3 | 32 |
| 95-th percentile | 43 |
| Maximum | 60 |
| Range | 60 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 15.276665 |
|---|---|
| Coefficient of variation (CV) | 0.72956422 |
| Kurtosis | -1.1423698 |
| Mean | 20.939439 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.10826787 |
| Sum | 14176 |
| Variance | 233.3765 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 183 | |
| 32 | 31 | 4.6% |
| 30 | 24 | 3.5% |
| 27 | 22 | 3.2% |
| 28 | 20 | 3.0% |
| 18 | 19 | 2.8% |
| 23 | 18 | 2.7% |
| 31 | 18 | 2.7% |
| 19 | 17 | 2.5% |
| 39 | 17 | 2.5% |
| Other values (38) | 308 |
| Value | Count | Frequency (%) |
| 0 | 183 | |
| 7 | 1 | 0.1% |
| 8 | 2 | 0.3% |
| 10 | 5 | 0.7% |
| 11 | 6 | 0.9% |
| 12 | 7 | 1.0% |
| 13 | 10 | 1.5% |
| 14 | 5 | 0.7% |
| 15 | 14 | 2.1% |
| 16 | 5 | 0.7% |
| Value | Count | Frequency (%) |
| 60 | 1 | 0.1% |
| 54 | 2 | 0.3% |
| 52 | 2 | 0.3% |
| 51 | 1 | 0.1% |
| 50 | 3 | |
| 49 | 2 | 0.3% |
| 48 | 3 | |
| 47 | 4 | |
| 46 | 7 | |
| 45 | 5 |
Insulin
Real number (ℝ)
High correlation  Zeros 
| Distinct | 154 |
|---|---|
| Distinct (%) | 22.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 67.895126 |
| Minimum | 0 |
|---|---|
| Maximum | 325 |
| Zeros | 323 |
| Zeros (%) | 47.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 40 |
| Q3 | 120 |
| 95-th percentile | 230 |
| Maximum | 325 |
| Range | 325 |
| Interquartile range (IQR) | 120 |
Descriptive statistics
| Standard deviation | 82.216534 |
|---|---|
| Coefficient of variation (CV) | 1.2109343 |
| Kurtosis | 0.21609085 |
| Mean | 67.895126 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 1.0469727 |
| Sum | 45965 |
| Variance | 6759.5585 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 323 | |
| 105 | 11 | 1.6% |
| 140 | 9 | 1.3% |
| 120 | 8 | 1.2% |
| 130 | 8 | 1.2% |
| 180 | 7 | 1.0% |
| 94 | 7 | 1.0% |
| 135 | 6 | 0.9% |
| 100 | 6 | 0.9% |
| 110 | 6 | 0.9% |
| Other values (144) | 286 |
| Value | Count | Frequency (%) |
| 0 | 323 | |
| 15 | 1 | 0.1% |
| 16 | 1 | 0.1% |
| 18 | 2 | 0.3% |
| 22 | 1 | 0.1% |
| 23 | 2 | 0.3% |
| 29 | 1 | 0.1% |
| 32 | 1 | 0.1% |
| 36 | 3 | 0.4% |
| 37 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 325 | 3 | |
| 321 | 1 | 0.1% |
| 318 | 1 | 0.1% |
| 310 | 1 | 0.1% |
| 304 | 1 | 0.1% |
| 300 | 1 | 0.1% |
| 293 | 2 | |
| 291 | 1 | 0.1% |
| 285 | 2 | |
| 284 | 1 | 0.1% |
BMI
Real number (ℝ)
| Distinct | 233 |
|---|---|
| Distinct (%) | 34.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.115657 |
| Minimum | 18.2 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 18.2 |
|---|---|
| 5-th percentile | 22.28 |
| Q1 | 27.4 |
| median | 32 |
| Q3 | 36.3 |
| 95-th percentile | 43.52 |
| Maximum | 50 |
| Range | 31.8 |
| Interquartile range (IQR) | 8.9 |
Descriptive statistics
| Standard deviation | 6.4257519 |
|---|---|
| Coefficient of variation (CV) | 0.20008159 |
| Kurtosis | -0.34146155 |
| Mean | 32.115657 |
| Median Absolute Deviation (MAD) | 4.5 |
| Skewness | 0.26666499 |
| Sum | 21742.3 |
| Variance | 41.290287 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32 | 13 | 1.9% |
| 31.6 | 12 | 1.8% |
| 31.2 | 11 | 1.6% |
| 33.3 | 10 | 1.5% |
| 30.8 | 9 | 1.3% |
| 32.8 | 9 | 1.3% |
| 32.4 | 9 | 1.3% |
| 33.6 | 8 | 1.2% |
| 32.9 | 8 | 1.2% |
| 34.2 | 8 | 1.2% |
| Other values (223) | 580 |
| Value | Count | Frequency (%) |
| 18.2 | 3 | |
| 18.4 | 1 | 0.1% |
| 19.1 | 1 | 0.1% |
| 19.3 | 1 | 0.1% |
| 19.4 | 1 | 0.1% |
| 19.5 | 2 | |
| 19.6 | 1 | 0.1% |
| 19.9 | 1 | 0.1% |
| 20 | 1 | 0.1% |
| 20.1 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 50 | 1 | |
| 49.7 | 1 | |
| 49.6 | 1 | |
| 49.3 | 1 | |
| 48.3 | 1 | |
| 47.9 | 2 | |
| 46.8 | 2 | |
| 46.7 | 1 | |
| 46.5 | 1 | |
| 46.3 | 1 |
DiabetesPedigreeFunction
Real number (ℝ)
| Distinct | 469 |
|---|---|
| Distinct (%) | 69.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.46666765 |
| Minimum | 0.078 |
|---|---|
| Maximum | 2.288 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 0.078 |
|---|---|
| 5-th percentile | 0.1408 |
| Q1 | 0.245 |
| median | 0.371 |
| Q3 | 0.614 |
| 95-th percentile | 1.1364 |
| Maximum | 2.288 |
| Range | 2.21 |
| Interquartile range (IQR) | 0.369 |
Descriptive statistics
| Standard deviation | 0.31562672 |
|---|---|
| Coefficient of variation (CV) | 0.67634155 |
| Kurtosis | 3.3831264 |
| Mean | 0.46666765 |
| Median Absolute Deviation (MAD) | 0.164 |
| Skewness | 1.6017738 |
| Sum | 315.934 |
| Variance | 0.099620228 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.254 | 6 | 0.9% |
| 0.258 | 5 | 0.7% |
| 0.268 | 5 | 0.7% |
| 0.259 | 5 | 0.7% |
| 0.207 | 4 | 0.6% |
| 0.299 | 4 | 0.6% |
| 0.263 | 4 | 0.6% |
| 0.26 | 4 | 0.6% |
| 0.692 | 4 | 0.6% |
| 0.261 | 4 | 0.6% |
| Other values (459) | 632 |
| Value | Count | Frequency (%) |
| 0.078 | 1 | |
| 0.084 | 1 | |
| 0.085 | 2 | |
| 0.088 | 2 | |
| 0.089 | 1 | |
| 0.092 | 1 | |
| 0.096 | 1 | |
| 0.1 | 1 | |
| 0.101 | 1 | |
| 0.107 | 1 |
| Value | Count | Frequency (%) |
| 2.288 | 1 | |
| 1.893 | 1 | |
| 1.781 | 1 | |
| 1.699 | 1 | |
| 1.698 | 1 | |
| 1.6 | 1 | |
| 1.476 | 1 | |
| 1.461 | 1 | |
| 1.441 | 1 | |
| 1.4 | 1 |
Age
Real number (ℝ)
High correlation 
| Distinct | 46 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.868538 |
| Minimum | 21 |
|---|---|
| Maximum | 66 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 24 |
| median | 29 |
| Q3 | 40 |
| 95-th percentile | 56 |
| Maximum | 66 |
| Range | 45 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 11.091557 |
|---|---|
| Coefficient of variation (CV) | 0.33745209 |
| Kurtosis | 0.16679676 |
| Mean | 32.868538 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.99712901 |
| Sum | 22252 |
| Variance | 123.02263 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 66 | 9.7% |
| 21 | 56 | 8.3% |
| 24 | 42 | 6.2% |
| 25 | 40 | 5.9% |
| 23 | 32 | 4.7% |
| 28 | 31 | 4.6% |
| 27 | 30 | 4.4% |
| 26 | 28 | 4.1% |
| 29 | 26 | 3.8% |
| 31 | 20 | 3.0% |
| Other values (36) | 306 |
| Value | Count | Frequency (%) |
| 21 | 56 | |
| 22 | 66 | |
| 23 | 32 | |
| 24 | 42 | |
| 25 | 40 | |
| 26 | 28 | |
| 27 | 30 | |
| 28 | 31 | |
| 29 | 26 | 3.8% |
| 30 | 18 | 2.7% |
| Value | Count | Frequency (%) |
| 66 | 4 | |
| 65 | 2 | 0.3% |
| 64 | 1 | 0.1% |
| 63 | 4 | |
| 62 | 3 | |
| 61 | 2 | 0.3% |
| 60 | 3 | |
| 59 | 2 | 0.3% |
| 58 | 7 | |
| 57 | 5 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 453 | |
| 1 | 224 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 453 | |
| 1 | 224 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 453 | |
| 1 | 224 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 677 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 453 | |
| 1 | 224 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 677 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 453 | |
| 1 | 224 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 677 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 453 | |
| 1 | 224 |
AgeCategory
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 441 | |
| 0 | 236 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 441 | |
| 0 | 236 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 441 | |
| 0 | 236 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 677 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 441 | |
| 0 | 236 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 677 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 441 | |
| 0 | 236 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 677 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 441 | |
| 0 | 236 |
Interactions
Correlations
| Age | AgeCategory | BMI | BloodPressure | DiabetesPedigreeFunction | Glucose | Insulin | Outcome | Pregnancies | SkinThickness | |
|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.994 | 0.142 | 0.382 | 0.038 | 0.288 | -0.112 | 0.348 | 0.621 | -0.064 |
| AgeCategory | 0.994 | 1.000 | 0.142 | 0.364 | 0.020 | 0.256 | 0.225 | 0.271 | 0.531 | 0.272 |
| BMI | 0.142 | 0.142 | 1.000 | 0.304 | 0.140 | 0.193 | 0.169 | 0.293 | 0.016 | 0.445 |
| BloodPressure | 0.382 | 0.364 | 0.304 | 1.000 | 0.018 | 0.257 | -0.076 | 0.160 | 0.186 | 0.056 |
| DiabetesPedigreeFunction | 0.038 | 0.020 | 0.140 | 0.018 | 1.000 | 0.071 | 0.224 | 0.247 | -0.031 | 0.160 |
| Glucose | 0.288 | 0.256 | 0.193 | 0.257 | 0.071 | 1.000 | 0.168 | 0.483 | 0.148 | 0.019 |
| Insulin | -0.112 | 0.225 | 0.169 | -0.076 | 0.224 | 0.168 | 1.000 | 0.250 | -0.138 | 0.502 |
| Outcome | 0.348 | 0.271 | 0.293 | 0.160 | 0.247 | 0.483 | 0.250 | 1.000 | 0.254 | 0.210 |
| Pregnancies | 0.621 | 0.531 | 0.016 | 0.186 | -0.031 | 0.148 | -0.138 | 0.254 | 1.000 | -0.080 |
| SkinThickness | -0.064 | 0.272 | 0.445 | 0.056 | 0.160 | 0.019 | 0.502 | 0.210 | -0.080 | 1.000 |
Missing values
Sample
| Pregnancies | Glucose | BloodPressure | SkinThickness | Insulin | BMI | DiabetesPedigreeFunction | Age | Outcome | AgeCategory | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 6 | 148 | 72 | 35 | 0 | 33.6 | 0.627 | 50 | 1 | 1 |
| 1 | 1 | 85 | 66 | 29 | 0 | 26.6 | 0.351 | 31 | 0 | 1 |
| 2 | 8 | 183 | 64 | 0 | 0 | 23.3 | 0.672 | 32 | 1 | 1 |
| 3 | 1 | 89 | 66 | 23 | 94 | 28.1 | 0.167 | 21 | 0 | 0 |
| 4 | 0 | 137 | 40 | 35 | 168 | 43.1 | 2.288 | 33 | 1 | 1 |
| 5 | 5 | 116 | 74 | 0 | 0 | 25.6 | 0.201 | 30 | 0 | 1 |
| 6 | 3 | 78 | 50 | 32 | 88 | 31.0 | 0.248 | 26 | 1 | 1 |
| 10 | 4 | 110 | 92 | 0 | 0 | 37.6 | 0.191 | 30 | 0 | 1 |
| 11 | 10 | 168 | 74 | 0 | 0 | 38.0 | 0.537 | 34 | 1 | 1 |
| 12 | 10 | 139 | 80 | 0 | 0 | 27.1 | 1.441 | 57 | 0 | 1 |
| Pregnancies | Glucose | BloodPressure | SkinThickness | Insulin | BMI | DiabetesPedigreeFunction | Age | Outcome | AgeCategory | |
|---|---|---|---|---|---|---|---|---|---|---|
| 758 | 1 | 106 | 76 | 0 | 0 | 37.5 | 0.197 | 26 | 0 | 1 |
| 759 | 6 | 190 | 92 | 0 | 0 | 35.5 | 0.278 | 66 | 1 | 1 |
| 760 | 2 | 88 | 58 | 26 | 16 | 28.4 | 0.766 | 22 | 0 | 0 |
| 761 | 9 | 170 | 74 | 31 | 0 | 44.0 | 0.403 | 43 | 1 | 1 |
| 762 | 9 | 89 | 62 | 0 | 0 | 22.5 | 0.142 | 33 | 0 | 1 |
| 763 | 10 | 101 | 76 | 48 | 180 | 32.9 | 0.171 | 63 | 0 | 1 |
| 764 | 2 | 122 | 70 | 27 | 0 | 36.8 | 0.340 | 27 | 0 | 1 |
| 765 | 5 | 121 | 72 | 23 | 112 | 26.2 | 0.245 | 30 | 0 | 1 |
| 766 | 1 | 126 | 60 | 0 | 0 | 30.1 | 0.349 | 47 | 1 | 1 |
| 767 | 1 | 93 | 70 | 31 | 0 | 30.4 | 0.315 | 23 | 0 | 0 |